© Zühlke APAC SWEX+DX 2024
© Zühlke APAC SWEX+DX 2024
© Zühlke APAC SWEX+DX 2024
| Use a product if... | Build a custom solution if... | |----------------------------------------|----------------------------------------------------| | ... data is already at the right place | ... need to integrate data sources | | ... it works well for your use case | ... out-of-the-box products don’t work well enough | | ... the licensing costs are acceptable | ... development- and operation-costs are cheaper |
© Zühlke APAC SWEX+DX 2024
© Zühlke APAC SWEX+DX 2024
© Zühlke APAC SWEX+DX 2024
© Zühlke APAC SWEX+DX 2024
Each answer is different, standard assertions are tricky. Extracting meaning with help of LLMs not good enough (yet)
© Zühlke APAC SWEX+DX 2024
Preprocess documents by removing non-content information (headers, footers, tables).
Structured formats work best!
© Zühlke APAC SWEX+DX 2024
> https://www.llamaindex.ai/blog/evaluating-the-ideal-chunk-size-for-a-rag-system-using-llamaindex-6207e5d3fec5
© Zühlke APAC SWEX+DX 2024
© Zühlke APAC SWEX+DX 2024
> Azure Cognitive Search: Outperforming vector search with hybrid retrieval and ranking capabilities
© Zühlke APAC SWEX+DX 2024
© Zühlke APAC SWEX+DX 2024
Your RAG application has policies for different regions stored. The policies have a high overlap on the vector map, as they cover similar topics. The vector search struggles to retrieve the correct chunks based on the region requested.
© Zühlke APAC SWEX+DX 2024
© Zühlke APAC SWEX+DX 2024
© Zühlke APAC SWEX+DX 2024
© Zühlke APAC SWEX+DX 2024
© Zühlke APAC SWEX+DX 2024